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THE CURIOUS CASE OF LYMAN ALPHA EMITTERS: 
GROWING YOUNGER FROM z - 3 TO z - 2? 
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ABSTRACT 

Lyman Alpha Emitting (LAE) galaxies are thought to be progenitors of present-day L* galaxies. 
Clustering analyses have suggested that LAEs at z ^ 3 might evolve into LAEs at z ~ 2, but it 
is unclear whether the physical nature of these galaxies is compatible with this hypothesis. Several 
groups have investigated the properties of LAEs using spectral energy distribution (SED) fitting, but 
direct comparison of their results is complicated by inconsistencies in the treatment of the data and 
in the assumptions made in modeling the stellar populations, which are degenerate with the effects of 
galaxy evolution. By using the same data analysis pipeline and SED fitting software on two stacked 
samples of LAEs at z = 3.1 and z = 2.1, and by eliminating several systematic uncertainties that 
might cause a discrepancy, we determine that the physical properties of these two samples of galaxies 
are dramatically different. LAEs at z = 3.1 are found to be old (age ^ 1 Gyr) and metal-poor 
{Z < 0.2Zq), while LAEs at z = 2.1 appear to be young (age ~ 50 Myr) and metal-rich {Z > Zq). 
The difference in the observed stellar ages makes it very unlikely that z=3.1 LAEs evolve directly into 
z=2.1 LAEs. Larger samples of galaxies, studies of individual objects and spectroscopic measurements 
of metallicity at these redshifts are needed to confirm this picture, which is difficult to reconcile with 
the effects of 1 Gyr of cosmological evolution. 



1. INTRODUCTION 

Lyman Alpha Emitting (LAE) galaxies have been 
shown to be building blocks of Milky- Way type galaxies 
today (|Gawiser et ai.1 120071 iGuaita et all [20Tol ). The 
brightness of the Lyman Alpha line allows one to 
detect these galaxies using the narrow-band technique 
even when the continuum is faint. As a result, large 
samples of LAEs have been studi ed using SEP fittini 
at a variety of re ds hifts (e.g.. iGawiser et al.l 1200 . 
Nilsson et ahl I2007L IGawiser et a l.l 1 20071 [Lai et al.| 

2007, 
2010a. 



20071 IPirzkal et al.l HoOtT 



Finkelstein et al. 
Lai et al.l 120081 iPentericci et al.n200aiPno et al, 



Ono et al.ll2010bl. lOuchi et al.1 120101 iNilsson et al.l "l20Tl 
Guaitaetal.1 I2011L iFinkelstein et al.l l2011bD . The 
picture emerging from these collective studies is far from 
homogeneous. While LAEs were once thought to be 
young, dust-free galaxies experiencing their first episode 
of star formation, many of these investigations suggest 
a large spread in their physical properties, and find 
evidence of an older, more evolved, and dustier stellar 
population component. 

An important caveat in the interpretation of re- 
sults comes from the systematic uncertainties intro- 
duced by inconsistent modeling of the stellar populations 
among different groups. This issue was studied in de- 
tail by lAcauaviva e t al.' (2011") , where we showed that 
the use of BC03 jB ruzual & Chariot 2003) and CBll 
(jCharlot fc Bruzual 1201 If ) stellar population synthesis 
(SPS) templates, at Solar or variable metallicity, and 
with or without including nebular emission, gave rise to 
a scatter in the estimate of age and mass significantly 
larger than the statistical uncertainty for the same data. 
Further scatter might be created by the use of a different 
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initial mass function (IMF) or of inconsistent statistical 
estimators of the physical properties of galaxies, such as 
the best-fit parameters as opposed to the mean of the 
probability distribution employed by Bayesian statistics. 

2. OBJECTIVES, METHODOLOGY, AND DATA 

This paper focuses on one particular puzzle, the ages 
of Lyman Alpha Emitters, and one particular redshift 
range, the Gyr between z ~ 3 and z ^ 2. LAEs have 
been mostly studied through stacking analyses, because 
their faint continua make it hard to fit the bro a d-band 
pho tometry of each galaxy, although lOno et "ahl (|2010aD 
and INilsson et a l. (2011) have also attempted to fit the 
SEDs of bright individual o bject s at these redshifts. At 
z ~ 3. IGawiser et al.l (|2007D and lLai et all (p008) (here- 
after LOS) examined the SEDs of average-stacked LAEs, 
segregated according to their detection in the IRAC 
3.6 /im band, and found that both samples were essen- 
tially dust-free, and that LAEs Undetected in IRAC were 
fairly young (age ^ 0.15 Gyr) and LAEs Detected in 
IRAC were o l d (ag e ~ 1.6 Gyr). At the same redshift, 
INilsson et al.l (I2007D also foun d an old median age of 0.85 
Gvr. while lOno et al.l (|2010aD found LAEs to be typically 
young (age < 0.1 Gyr), although a few of the individu- 
ally fitted K-de tected objects appea red to be older. At 
redshift z ~ 2, IGuaita et"all (j2011[) revealed the LAEs 
to be young (age < 0.1 Gyr), and did not find a notice- 
able difference in the age of LAEs when classified and 
stacked on th e basis o f their IRAC 3.6 /i m band lu- 
minosity. Nils son et al.l ()2011|) reported finding a young 
(age < 0.05 Gyr) and an old (age ^ 1 Gyr) stellar pop- 
ulation, of comparable mass, in z = 2.3 LAEs, although 
the authors did not report whether the two-component 
fit was statistically favored with respect to a fit with a 
single stellar population. 

The purpose of our work is to compare the properties 
of LAEs at z ~ 3 and z ~ 2 by using exactly the same 
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Fig. 1. — Data points and best fit spectra for the full LAE sam- 
ples. The black diamonds show the flux predicted by the best-flt 
model in each band. The Ly— a line has been subtracted from the 
photometry and is not included in the templates used to flt the 
SED. In the top panel, data at 2 = 3.1 have laeen offset to account 
for the different luminosity distance. The dip in the SED at rest- 
frame wavelengths ~ 1 fim is typical of low-metallicity, old stellar 
populations. 

pipeline to compose the stacked samples and employing 
the same, state-of-the-art algorithm to analyze the SEDs. 
This procedure will discriminate between physical evo- 
lution of LAEs and artificial differences introduced by 
inconsistencies in building the stacks and modeling the 
stellar populations. 

The LAE s tacks at z = 2.1 are those presented in 
iGuaita et al.l (|2011[ ). hereafter Gull. We consider the 
full sample, which includes 216 objects, as well as the 
IRAC-bright and IRAC-dim subsamples, which were 
built using flux cuts corresponding to the IRAC detection 
criterion used by LOS, appropriately rescaled to account 
for the difference in luminosity distance. These samples 
were composed by median stacking, which is less sensi- 
tive to outliers than average stacking. For this reason, we 
also revisit the Detected and Undetected stacks of LOS 
and use median stacking, and we compose a new median- 
stacked SED (the full s ample a t z = 3.1) that comprises 
70 LAEs from Gronwal fetall ()2007l ). At both redshifts, 
LAEs are selected based on rest-frame Lya equivalent 
width > 20 and on the flux in the Ly-a line. The flux 
limits used in the two surveys lead to very similar rest- 
frame Ly-a luminosity limits, making the samples at 
z = 3.1 and z = 2.1 suitable for a comparative study 
(jCiardullo et al.l l2012D. At z = 3.1. we h ave UBVRIzJK 
data from MUSYC (|Gawiser et al.ll2006[ ) and IRAC pho- 
tometry, while at z = 2.1 we also have H band data. The 
photometry is computed using IRAF; we use an aperture 
radius of 1 arcsec in the UV, optical and NIR bands, and 
1.25 arcsec in the IRAC bands. The fluxes are aperture- 
corrected to estimate the total fluxes as described in 



iGawiser et al.l ()2006[ ). For both samples, objects found 
in crowded IRAC regions, where imperfect subtraction of 
bright nearby neighbors would bias the photometry, are 
discarded before the stacking. We cross-checked that the 
photometric data obtained using SExtractor are compat- 
ible with those computed through IRAF. Errors on the 
stacked photometry are determined by a bootstrap pro- 
cedure, which accounts for both the photometric error 
and the scatter resulting from the spread in the SEDs 
of different galaxies. At both redshifts, the latter is the 
predominant cause of uncertainties; as a result, the size 
of the error bars and the number of objects in each stack 
are effectively uncorrelated. 

We us e the Markov Chain Monte Carlo code 
GalMC (jAcauaviva et al.l 1201 If ) to flt the spec- 
tral energy distribution of t hese six sample s, and 
the "GetDist" software from iLewis fc Bridie ()2002l ) 
to analyze the chains. The data are made avail- 
able on the MUSYC public data release website at 
jhttp: / / physics.rutgers.edu/ ^gawiser/MUSYC / data.html| 

3. RESULTS 

The SEDs of the median-stacked full samples at red- 
shifts z = 3.1 and z = 2.1 are plotted in Fig. [TJ We 
begin by fltting four parameters: age since the beginning 
of star formation (required to be lower than the age of the 
Universe at that redshift), dust content parametrized by 
the excess color E(B-V), stellar mass (which is the in- 
tegral of the star formation rate minus the mass loss 
due to the life cycle of stars), and n ietallicity (using 
a flat prior in log Z, as discussed in lAcauaviva et al.l 
[20T1. We use the latest CBll models, use the Calzetti 
law ([Calzetti et al.l [2000l) to compute the attenuation, 
assume constant star formation history, favored by the 
previous analyses of LOS and Gull, and adopt a Salpeter 
IMF. We inc lude the contribu t ion of nebular emission as 
described in lAcauaviva et al.l (120 111), fol l owing the pro- 
cedure outlined bv lSchaerer fc de BarrosI (120091 ) . Results 
from SED fitting are shown in Figs. [2] and [3] and sum- 
marized in Table [T] 

Unsurprisingly, given the different shapes of their 
SEDs, we find strong differences in the properties of 
z — 3.1 and z — 2.1 Lyman Alpha Emitters. The 
higher-redshifts LAEs are dust-free, have strongly sub- 
Solar metallicities, have mean masses of ~ 1.5 x 10^ M©, 
and are significantly older, with mean ages of 1 Gyr, than 
their lower-redshift counterparts, which present a very 
moderate amount of dust, appear to have Solar metallic- 
ity, have mean masses of 3 x 10* M0, and have mean 
ages of 45 Myr. 

Results for the fuU, IRAC-bright, and IRAC-dim 
stacked samples of LAEs at z — 2.1 can be compared 
directly to the ones for the same stacks of Gull; the 
main difference is the lower amount of dust (E(B-V) ~ 
0.11 vs 0.22 for the full sample) found by this analysis, 
which is a direct consequence of the inclusion of neb- 
ular emission lines in our code. We can also compare 
our median-stacked Detected and Undetected samples to 
the average-stacked Detected and Undetected samples of 
LOS. In this case the main difference is the older age es- 
timated for the Undetected sample, from the 160 Myr of 
LOS to the 450 Myr found by us. This difference comes in 
part from the median stacking, which alters the shape of 
the SED slightly, enhancing the flux in the IRAC 3.6 and 
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Sample 


z 


Notes 


Z/Zo 


Age (Gyr) 


E(B-V) 


M* (10^ M/M©) 


best fit xVd.o f. 


full 


3.1 




0.025 [0.005-0.04] 


0.98 [0.84-2.0] 


0.019 [0.-0.022] 


16 [13-19] 


11.9/8 


full 


3.1 


fixed Z 


0.2 


0.7 [0.6-0.87] 


0.015 [0-0.02] 


13 [12-18] 


14.7/9 


full 


3.1 


ESF, fixed Z 


0.02 


0.97 [0.76-2.1] 


0.025 [0-0.031] 


15 [12-18] 


9.5/8 


full 


3.1 


2 Pop, fixed Z 


0.02 


0.45 [0.12-2.1] 


0.0195 [0-0.024] 


15 (37 [0-54]) 


8.1/8 


full 


2.1 




1.6 [1.4-2.1] 


0.05 [0.02-0.1] 


0.12 [0.1-0.15] 


3.2 [1.9-5.5] 


20.0/9 


full 


2.1 


fixed Z 


0.2 


0.09 [0.06-0.13] 


0.14 [0.12-0.16] 


6.4 [5.2-7.7] 


33.0/10 


full 


2.1 


ESF, fixed Z 


1.0 


0.07 [0.04-0.11] 


0.11 [0.09-0.14] 


5.3 [3.7-7.2] 


19.7/9 


full 


2.1 


2 Pop, fixed Z 


1.0 


0.05 [0.03-0.1] 


0.12 [0.09-0.14] 


3.7 (0.07 [0-0.2]) 


19.7/9 


IRAC Det 


3.1 




0.05 [0.005-0.1] 


1.0 [0.93 - 2.1] 


0.05 [0-0.06] 


56 [45-69] 


9.7/8 


IRAC Bright 


2.1 




0.2 [0.03-1.2] 


0.03 [0.01-0.08] 


0.29 [0.27-0.32] 


17 [9-39] 


19.2/9 


IRAC Und 


3.1 




0.025 [0.005-0.04] 


0.45 [0.4-0.6] 


0.024 [0-0.03] 


6.7 [5-8.5] 


19.5/8 


IRAC Dim 


2.1 




2.7 [2.3-5.0] 


0.07 [0.02 -0.2] 


0.03 [0 - 0.04] 


1.8 [1.2-2.8] 


22.7/9 



TABLE 1 

Mean expectation values and 68% credible intervals from BED fitting for the six different samples considered in this 
WORK. For the two-population fit, we report the mean value of the total mass and the ratio between the mass in the old 
stellar, population and the total mass and its uncertainty in parentheses. 
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Fig. 2. — Marginalized probability distributions for the SED fitting parameters for z = 3.1 (blue, dashed), and z = 2.1 (black, solid) full 
LAE samples. Curves are normalized to have the same peak height. 

4.5 /im bands by a factor of 1.8, and in part from ahowing 
metalhcity to vary and finding Z/ Zq < 0.2. The quoted 
LOS results assumed Solar metalhcity, although the au- 
thors reported, consistent with our findings, that their 
age estimate for the Undetected population increased if 
Z = 0.2 Zq was used. 

4. THE MYSTERY OF THE AGES OF LAES 

The evolutionary picture of LAEs between z 3 and 
z ~ 2 suggested by our results is quite puzzling. The 
same type of galaxies that are found to be old and metal- 
poor at z ~ 3 appear to be young and metal-rich only a 
Gyr later. Before trying to reflect on the implications 
of this scenario, we consider several possible artificial 
sources of such a sharp dichotomy. 

4.1. Data quality tests 

We begin by examining in detail the stacked SEDs of 
Fig. [TJ At z = 3.1, the best-model has a very reasonable 
reduced of 1.4. This is a desired property because the 
stacking procedure relies on the strong assumption that 
"the typical LAE" exists, and that the artificial SED 
built by stacking many real SEDs is a reasonable rep- 
resentation of the spectrum of this typical LAE galaxy. 
A reasonable reduced value suggests that the boot- 
strap uncertainties correctly account for the spread in 
the SEDs of LAEs and does not illuminate a template 
incompleteness problem. Nonetheless, there is a relevant 
amount of scatter in the stacked SED, with three points 
particularly standing out, the low measurements in the 



J and IRAC 5.8 /im band, and the high measurement in 
the K band. We re-did the SED fitting excluding each of 
them, and each pair, in turn, and we found that our re- 
sults are extremely robust to the exclusion of these data 
points, with changes in the parameter estimates by less 
than 0.2cr. We also note that the results at 2 = 3.1 
for both age and me talhcity are in good agreement with 
those of INilsson et al. (2007), and that the analysis of 
lOno et alT i 2010aj) favors low metalhcity values. Strongly 



sub-Solar r netallicities were also fo und in Damped Lya 
systems bv iProchaska et all ()2003D . 

At z = 2.1, the SED visually appears to be smoother; 
however, its reduced has a slightly higher value of 
2.2. This feature suggests that the stacking at this red- 
shift might be an overly aggressive compression of the 
information contained in the individual SEDs. Breaking 
down the sample in IRAC-bright and IRAC-Dim LAEs 
in the attempt to increase the homogeneity of the stacks 
does not afford a better fit, although it shows that the 
properties of these two stacks follow closely the ones of 
the full sample. 

4.2. Metallicity 

The SED fitting parameters are correlated with each 
other, and in particular, there is a well-know age- 
metallicity relation that can be seen easily as the axis 
of the 2-D credible region in the age-metallicity plane of 
Fig. [31 Our estimates of metallicity depend on the SPS 
templates we use and (although weakly) on the flat prior 
on log Z that we assume. We think that the latter is 
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Fig. 3. — 2-dimensional marginalized contours illustrating the correlations between the most correlated parameters. The solid lines 
show the 68 and 95% error regions for the parameters; the color scale traces the likelihood. For flat priors, lack of exact overlap signals 
non-Gaussianity of the posterior distribution. 

tion histories, as can be seen in Table [1] The data favor 
a nearly constant SFH at both redshifts, with a best-fit 
value of T around 4 Gyr for both samples. Gull had 
also performed the SED fitting using these SFHs for the 
z = 2.1 sample, albeit with a different algorithm, and 
had found similar results. 



physically motivated, and we have no reason not to trust 
the models; however, it is possible that the observed dif- 
ference in metallicity between LAEs at the two redshift 
is overestimated. If the true metallicity at z = 3.1 were 
higher, and the true metallicity at z — 2.1 were lower, 
this might help reconcile the gap in the physical proper- 
ties of these galaxies. To test this hypothesis, we run our 
SED fitting code on the full samples at the two redshifts, 
holding the metallicity fixed at Z = Q.2Zq\ this value is 
suggested by the spectroscop i c analy sis of three LAEs at 
z ^ 2.3 by 'Finkels tein et al.l ()2011q ). Results are shown 
in Fig. |4] and listed in Table [1] As expected, the esti- 
mates for the mean ages of the stellar population shift in 
the desired direction, but remain incompatible at several 
sigma level. This result seems to exclude metallicity as 
the primary source of the gap in the age of LAEs, at least 
in this stacking analysis. 

4.3. Star Formation History 

The assumption of a particular functional form for the 
star formation history of LAEs can also infiuence the 
determination of ages. We investigate this issue by re- 
peating the SED fitting procedure on the full samples 
using exponentially increasing and decreasing star for- 
mation histories (SFHs), o<; e^*/'^. By sampling in 
l/r, both increasing and decreasing SFHs are explored 
as part of a contiguous p arameter space, as explained 
in lAcquaviva et al.l ()2011[ ) . This parametrization is also 
able to capture constant or starburst SFHs for appro- 
priate values of r/age. The results of this SED fitting 
run (where the metallicities were held fixed at the best- 
fit value for both samples) are shown in Fig. |31 in both 
cases the estimates of ages do not change significantly. 
The quality of the fit does not improve significantly in 
either case by allowing this larger range of star forma- 



4.4. A hidden old stellar population 

A possible explanation of the difference in the phys- 
ical nature of LAEs at z = 3.1 and z = 2.1 might be 
the presence of stellar components of different ages. We 
test this hypothesis by performing the SED fitting on 
the full samples using two stellar populations, fixing the 
age of the second stellar population at 1 Gyr, and intro- 
ducing the stellar mass of the second stellar population 
as a free SED fitting parameter. We show the results of 
this test for a single-burst model that would solely con- 
tain old stars, although we have also considered constant 
or linearly increasing SFHs for the second stellar pop- 
ulation, as suggested by some re cent observations (e.g., 
iFinkelstein et al.]|2011l : lLeell2010l and references therein). 
For z = 2.1 LAEs, we also consider a few different values 
of metallicity and dust content for the second popula- 
tion. At z = 2.1, this test is meant to reveal an older 
stellar population; at z = 3.1, it may allow us to sin- 
gle out two components of different ages, although we do 
not expect to find a large component of very young stars, 
since they would dominate the rest-frame UV part of the 
total spectrum. In the case of z = 3.1 LAEs, adding a 
second old stellar population slightly improves the qual- 
ity of the fit (from Xr = 1-4 to Xr = l-O)- Young ages 
for the primary stellar population are allowed, but only 
when the fraction of mass in this stellar population is 
small, as seen in the third panel of Fig. |4l For example, 
ages of the order of 100 Myr are allowed at 68% confi- 
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Fig. 4. — Left two panels: Variation in the posterior probability distribution of age of the full samples, for different assumptions described 
in the text. The black (solid, thick) line is the reference case ran with CSF and varying metallicity. The red (dotted-dashed) line is for 
Z fixed at 0.2Zq, the green (dashed) line corresponds to exponential star formation, and the magenta (solid, thin) line is the age of the 
primary population when a second, 1 Gyr old SSP is added. Right panels: Probability distribution of the fraction of mass in the old stellar 
population versus the age of the primary population. For the z = 3.1 LAEs, a young stellar component is allowed only when most of the 
mass is in the old stellar population. 

dence when the mass in young stars is less than 10% of 
the total stellar mass. As expected since the two stellar 
populations are so similar, the mass ratio in the two pop- 
ulations is unconstrained. For z = 2.1 LAEs, the quality 
of the fit does not improve by adding a second stellar 
population, and again the age estimate for the first stel- 
lar population does not change from the value of ^ 50 
Gyr favored by the single population fit, and the mass 
fraction in the "Old" stellar population is found to be 
consistent with zero, and < 50% at 2a confidence for all 
cases considered. We conclude that the two-component 
scenario cannot be confirmed by analyzing the stacked 
SEDs, although the improvement in the ^ — 3-1 

might indicate a heterogeneity of the stellar populations, 
masked by the stacking process. 



5. CONCLUSIONS 

The clustering analysis of LAEs (jGuaita et al.ll20l"ol) 
has shown that LAEs at z ^ 3 and z ~ 2 are hosted 
by dark matter halos that can be in a direct progenitor- 
descendant relationship. An intriguing possibility is that 
they might represent two different stages of the same 
galaxy population. We test this hypothesis by using the 
same data analysis pipeline and SED fitting software; 
this strategy allows us to discriminate between evolution 
in the galaxy properties and the effects of different as- 
sumptions in the processing of data and the modeling of 
the stellar populations. We find that the physical prop- 
erties of LAEs at z = 3.1 are very different from those 
of LAEs at z = 2.1, which appear to be several times 
younger and more metal-rich than their higher-redshift 
counterparts. If these samples are a fair representation 



of LAEs at each redshift, this result directly rules out the 
hypothesis that z=3.1 LAEs evolve into z=2.1 LAEs, im- 
plying that the LAE phase lasts significantly less than 1 
Gyr. Moreover, such stark differences between emission- 
line selected galaxy samples become difficult to explain 
as the effect of 1 Gyr of cosmological evolution. 

We find hints that stacking analysis, meant to provide 
insights on the nature of the typical LAE galaxy, might 
be overly aggressive and perhaps unable to capture the 
multi-component nature of LAEs at each redshift. To 
be properly investigated, these issues require constraints 
from SED fitting of large samples of individual objects, 
as well as spectroscopic constraints on the metallicity of 
LAEs. The former have been especially challenging to 
acquire so far, because of the lack of deep data in the 
observed NIR region of the spectrum. This gap is being 
filled by deep YJH HST observations by the Cosmic As- 
sembly Ncar-infrarcd Deep Extragalactic Survey (CAN- 
DELS, lGrogineLaUl2fllI|; |KoeW and 
we plan to address this issue by using CANDELS data 
in the MUSYC fields in a subsequent paper (Vargas et 
al 2012, in preparation). 
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